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Q (57) Abstract: The present invention provides compositions and methods for identifying candidate compounds capable of altering 
a phenotype of a cell that mediate their activity by binding with a specific chaperone of interest. Also provided are methods and 
^ compositions for identifying the target of the candidate compound-chaperone complex. 
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METHODS OF SCREENING CYCLIC PEPTIDES AND IDENTIFYING 
TARGETS THEREFOR 

5 

CROSS REFERENCES TO RELATED APPLICATIONS 

[0001] This application claims the benefit of U.S. Provisional Application No. 60/407,385, 
filed August 30, 2002, the entire contents of which is incorporated herein by reference. 

FIELD OF THE INVENTION 

10 [0002] The present invention relates to improved methods of screening libraries of cyclic 
peptides that utilizes a chaperone molecule capable of binding a portion or region of the 
cyclic peptide library members. Use of the chaperone permits the identification of cyclic 
peptides that alter a phenotype of a cell when complexed with the chaperone, as well as 
isolation and identification of the target protein with which the complex interacts. 

15 BACKGROUND OF THE INVENTION 

[0003] The recent advent of myriad library screening methodologies permits the rapid 
screening of libraries of thousands, and in many instances, even millions or trillions, of 
compounds to identify those compounds within the library having a desired property, such as 
for example, the ability to bind a particular target protein of interest or the ability to agonize 

20 or antagonize a cellular function of interest. These latter functional screening assays are of 
particular importance, as they permit the identification of compounds capable of effecting a 
certain biological response, such as the ability to alter a phenotype of a cell in a particular 
fashion, without knowledge of the target proteins responsible for the response. One such 
functional assay utilizes cell lines (typically human) stably transfected with a construct 

25 comprising a reporter gene operably linked to an element, such as a promoter, responsive to a 
specific effector molecule of interest. Agonists or antagonists of the specific effector 
molecule, or a molecule in a complex signal transduction pathway including the effector 
molecule, can be readily identified by administering a candidate compound to the stably 
transfected cells and monitoring the expression, either at the transcriptional or translational 
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level, of the reporter gene. Other functional assays bypass the need to create such constructs 
by monitoring cells comprising candidate compounds for detectable or observable changes in 
phenotype, which can range from a change in cellular morphology, viability, growth, etc. to 
changes in the expression of one or more endogenous gene products, to changes in the ability 
5 of the cell to transmit signals intracellularly or intercellularly to other cells. 

[0004] Screening molecular libraries of compounds for the ability to alter a specific 
phenotype of a cell, such as the ability to modulate intra- or intercellular signal transduction 
cascades or pathways, has led to discoveries of significant clinical importance. For example, 
both cyclosporin A (CsA) and FK506, which are potent immunosuppressants administered to 

10 organ transplant recipient to prevent organ rejection, were identified in pharmaceutical 

screening assays designed to detect inhibition of T-cell activation. Although CsA and FK506 
bind different target proteins — cyclophilin A (CypA) and FK506 binding protein (FKBP), 
respectively — the biological effect of both compounds is virtually the same: profound and 
specific suppression of T-cell activation, phenotypically observable in T-cells as inhibition of 

15 mRNA production dependent upon transcription factors such as NF-AT and NF-kB. 

[0005] One class of molecules that has been shown to play an important role in modulating 
signaling pathways is peptides. The literature is replete with examples of peptides capable of 
modulating such pathways. For example, a peptide derived from the HLV-l envelope protein 
has been shown to block the action of cellular calmodulin. Peptides corresponding to regions 
20 of the Fas cytoplasmic domain have been shown to have death-inducing or G protein 

inducing functions. Peptides corresponding to the protein kinase C isozyme jSPKC block 
nuclear translocation of /3PKC in Xenopus oocytes following stimulation. Moreover, the two 
immunosuppressants discussed above, CsA and FK506, although not composed entirely of 
genetically encoded amino acids, are cyclic peptides. 

25 [0006] These few examples highlight the importance of peptides, both linear and cyclic, as 
candidate compounds in pharmaceutical screening assays. Cyclic peptides are particularly 
attractive candidate compounds because they generally have greater bioavailability and are 
more resistant to in vivo degradation by enzymes such as proteases than their linear 
counterparts. To this end, several methodologies suitable for delivering and screening 

30 libraries of linear and cyclic peptides for functional activity have been developed. Two 
exemplary assay systems are described in U.S. Patent No. 6,153,380 and WO 01/66565. 
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Both of these exemplary systems utilize retroviral vectors to deliver libraries of 
polynucleotides to target cells to screen for and identify expression products of the 
polynucleotides that alter a phenotype of the target cells. The system described in WO 
01/66565 is specifically adapted to screening libraries of expressed cyclic peptides. 

5 [0007] Although powerful, assay systems designed to screen cyclic peptides are not without 
drawbacks. Once a candidate compound having a desired biological property of interest is 
identified, it is often desirable to identify the target molecule(s) with which the identified 
compound interacts. For linear peptides this is typically accomplished using yeast two- 
hybrid assays or variants of such assays. However, cyclic peptides are not amenable to yeast 
10 two-hybrid assays. As a consequence, identifying targets that interact with biologically 
active cyclic peptides provides significant challenges. 

[0008] Random functional screening assays provide significant challenges, as well. It is well 
established that intracellular signaling occurs through a series of protein-protein interactions. 
In many instances, candidate compounds such as linear and peptides and cyclic peptides may 

1 5 not have enough size or bulk to disrupt or effect such protein-protein interactions. In fact, the 
art is replete with examples of small molecules, for example cyclic peptides, that exert their 
biological effect when complexed with larger "chaperone" proteins. Presumably, the 
specificity of the biological effect is provided by the small molecule and the chaperone 
protein provides bulk and a larger interaction surface than that provided by the small 

20 molecule alone. Specific examples of small molecules that exert their biological activity 
when complexed with larger chaperone proteins are the immunosuppressant drugs 
cyclosporin A (CsA), FK506 and Rapamycin ("Rap"). The immunosuppressant activity of 
both CsA and FK506 derives from the formation of a complex with their respective 
immunophilins (CypA and FKBP, respectively). The CsA-CypA and FK506-FKBP 

25 complexes inhibit or interfere with cytoplasmic signaling cascades leading to transcription of 
lymphokine genes in T-cells (Schreiber, 1991, Science 251:283-287; Bierer et al, 1991 Proc. 
Nad. Acad. Sci. USA 87:9231-9235; Dumont etaL, 1990, J. Immunol. 144:251-258), 
presumably by binding to, and inhibiting the activity of, the phosphatase calcineurin (see, 
e.g., Cameron, et aL, 1997, J. Biol. Chem. 272:27582-27588). Similarly, a complex formed 

30 between FKBP and Rap, which is structurally related to FK506, inhibits growth factor- 
dependent signaling cascades that, if not so inhibited, lead to proliferation of T-cells 
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(Schreiber, 1992, Tetrahedron 48:2545-2558). The Rap-FKBP complex has also been shown 
to inhibit the phosphorylation of a 70 kD S6 protein kinase, leading a decrease in protein 
synthesis accompanying cell cycle progression (Calvo et al 9 1992, Proc. Natl. Acad. Sci. 
USA 89:7571-7575; Chung et a/., 1992, Cell 69:1227-1236; Kuo et a/., 1992, Nature 358:70- 
5 73; Price et a/., 1992, Science 257:973-977). The dephosphorylation of p70S 6K is believed to 
be mediated by the formation of a ternary complex of FKBP12, Rap and FKBP 1 2-rapamycin- 
associated protein (FRAP) (Desai, et al. 9 2001, Proc. Natl. Acad. Sci. 99:4319-4324). The 
Rap-FKBP complex also inhibits and to inhibit activation of cdk2/cyclin E complex 
(Flanagan et al. 9 1993, J. Cell Biochem. 17A:292). In all of these cases, the activities of CsA, 
10 FK506 and Rap are not mediated by direct binding to the effector molecule, but by action of a 
CypA-, FK506- or Rap-immunophilin complex on upstream component(s) which regulate the 
effector molecule. 

[0009] Quite interestingly, the biological effects of FK506-FKBP and RaP-FKBP complexes 
are distinct. FKBP12 has been shown to bind a specific lencine-proline sequence (1400- 

15 1401) of the inositol 1,4,5-triphosphate receptor (IP3R) and anchors calcincurin to this 
domain (see, e.g., Cameron et al, 1997, J. Biol. Chem. 272:27582-27588). The Leu-Pro 
structure resembles a portion of the structure common to both FK506 and Rap. The 
similarity in structure between the FKBP 12 binding region of EB3R, FK506 and Rap suggests 
that this structure is essential to the binding of FK506 and Rap to FKBP12. The differences 

20 in the biological activities FK506 and Rap suggest that the remainder of the molecules hold 
the key to their specificities. Presumably, the FKBP, and CyPA in the case of CsA, act as 
"chaperones" that "present* ' the CsA, FK506 or Rap to the targets with which the respective 
complexes interact 

[0010] It would be highly desirable to have available screening methods that are able to take 
25 advantage of the ability of such "chaperone" proteins, such as immunophilins, to "present" 
small candidate compounds to target molecules to effect or disrupt protein-protein 
interactions in order to identify candidate compounds having specified biological activities, 
such as the ability to alter a phenotype of a cell. Moreover, it would be highly desirable to 
have available screening methods which aid the identification of the target proteins with 
30 which the identified candidate compound interacts to exert its biological efifect(s). 
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SUMMARY OF THE INVENTION 

[0011] These and other objects are furnished by the present invention, which provides 
methods and compositions for screening candidate compounds, such as peptides and cyclic 
peptides, that take advantage of the ability of certain proteins to act as "chaperones" to 
5 "present" biologically active molecules such as cyclic peptides to target molecules. The 
methods provide for the identification of candidate compounds that alter a phenotype of a 
target cell when complexed with a specified chaperone protein, and also provide a means for 
isolating and identifying the target protein with which the candidate compound-chaperone 
complex interacts. In preferred embodiments, the methods and compositions provide for the 
10 identification of candidate compounds that modulate, and in particular inhibit, intra- and/or 
extracellular signal transduction cascades when complexed with immunophilins such as FK 
binding proteins and cyclophilins. 

[0012] Accordingly, in one aspect, the present invention provides a method of identifying 
compounds that are capable of altering a phenotype of a cell by binding or interacting with a 

1 5 specified chaperone protein. The method generally comprises administering to a cell a 

candidate compound of interest that comprises two regions or domains: a chaperone binding 
region and a putative target binding region. The structure of the chaperone binding region is 
known and is designed to bind a particular chaperone of interest. The structure of the 
putative target binding region is typically unknown, either in whole or in part, and constitutes 

20 the portion of the candidate compound being assessed for potential biological activity. The 
cell is then assessed to determine whether a phenotype of the cell has been altered. Cells 
comprising active candidate compounds can be analyzed to determine the identity of the 
active candidate compound, and in particular the structure of its putative target binding 
region. 

25 [0013] In one embodiment of the method, the candidate compound is a cyclic peptide in 
which one end of the chaperone binding region is fused, either directly or by way of an 
optional linker, to one end of the putative target binding region and the other end of the 
chaperone binding region is fused, either directly or by way of an optional linker, to the other 
end of the putative binding region. The amino acid sequence of the putative target binding 

30 region may be completely random, or it may include one or more sub regions of specified 
sequence(s). In another embodiment of the method, the chaperone binding region of the 
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candidate compound is capable of binding an immunophilin, such as, for example an FK 
binding protein or a cyclophilin. 

[0014] In embodiments of the method that employ synthetic candidate compounds that are 
capable of traversing cell membranes, the candidate compounds may be administered to the 
5 cell by contacting the cell with the candidate compound. In embodiments of the method that 
employ candidate compounds that can be expressed or transcribed by polynucleotides, such 
as peptides or RNAs, the candidate compound can be administered to the cell via a 
polynucleotide capable of expressing or transcribing the candidate compound in the cell. In 
such embodiments, the sequences of active candidate compounds, and in particular the 
10 sequences of their respective putative target binding regions, can be determined by 
sequencing the polynucleotides of cells exhibiting an altered phenotype. 

[0015] In another aspect, the present invention provides a method of identifying a target 
molecule with which an identified candidate compound-chaperone complex interacts. The 
method generally comprises administering to a cell a candidate compound which comprises 
15 two regions: a chaperone binding region and a putative target binding region, as previously 
described. The cell is then assessed for the presence or absence of an altered phenotype, as 
previously described. A cell lysate of a cell having an altered phenotype is contacted with an 
agent, which is preferably immobilized on a solid support, which specifically binds the 
chaperone. Any target protein bound to the chaperone is then isolated and characterized. 

20 [001 6] hi yet another aspect, the present invention provides libraries of candidate compounds, 
each of which includes chaperone binding region of known structure and a putative binding 
region of unknown structure (either in whole or in part), polynucleotide libraries capable of 
expressing, either at the transcription or translation level, RNA or peptide embodiments of 
such candidate compounds, retroviral libraries comprising the polynucleotide libraries and 

25 cellular libraries comprising the retroviral libraries. In one embodiment of the various 

libraries, the chaperone binding region of each candidate compound is the same. In another 
embodiment of the various libraries, the putative binding region of each candidate compound 
is distinct. 

[0017] The methods and compositions of the invention provide myriad advantages over 
30 traditional screening techniques. The complexes formed between the candidate compounds 
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and the chaperone protein are larger than the candidate compound alone, thus providing bulk 
upon binding that may inhibit the activity of the target via steric hindrance, even in instances 
where the candidate compound does not bind directly to an active site. Moreover, in 
instances where the chaperone protein is highly expressed, such as when the chaperone is an 
5 immunophilin, the identified candidate compounds will make ideal leads for pharmaceutical 
development. Coadministration of the chaperone should not be necessary — owing to its high 
expression, the active complex will form in situ following administration of the identified 
candidate compound. Moreover, since only a fraction of the chaperone will be occupied by 
the cyclic peptide, it will be free to canry out its other biological functions. 

10 [0018] The methods and compositions of the invention also provide an easy, efficient means 
of identifying the target with which the complex interacts. Since the chaperone is known, 
active target-candidate compound-chaperone complexes can be isolated using affinity 
reagents specific for the chaperone and the target dissociated there from and characterized. 

BRIEF DESCRIPTION OF THE FIGURES 

15 [0019] FIG. 1 provides a cartoon illustrating the principles of the invention; 

[0020] FIG. 2 provides a cartoon illustrating the protein splicing reaction mediated by a 
contiguous intein; 

[0021] FIG. 3 provides a cartoon illustrating the presumed mechanism of action of the protein 
splicing reaction of FIG. 2; 

20 [0022] FIG. 4 provides a cartoon illustrating the protein splicing reaction mediated by a split 
intein to generate a candidate cyclic peptide; 

[0023] FIG. 5 provides a cartoon illustrating the presumed mechanism of action of the protein 
splicing reaction of FIG. 4; 

[0024] FIG. 6 provides a cartoon illustrating a candidate polynucleotide intein construct 
25 capable of expressing a candidate cyclic peptide; and 
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[0025] FIG. 7 provides the nucleotide sequence (coding strand) and translated amino acid 
sequence of the fc-extein-lN-BFP region of the retroviral vector of FIG. 6 (stop codons are 
indicated with"."). 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

5 Abbreviations 

[0026] The abbreviations used for the genetically encoded amino acids are conventional and 
are as follows: 



Amino Acid 


Three-Letter 


One-Letter 


Alanine 


Ala 


A 


Arginine 


A mm ~ 

Arg 


R 


Asparagine 


Asn 


XT 

N 


Aspartate 


Asp 


D 


Cysteine 


Cys 


C 


Glutamate 


GLu 


E 


Glutamine 


Gin 


Q 


Glycine 


Gly 


G 


Histidine 


His 


H 


Isoleucine 


De 


I 


Leucine 


Leu 


L 


Lysine 


Lys 


K 


Methionine 


Met 


M 


Phenylalanine 


Phe 


F 


Proline 


Pro 


P 


Serine 


Ser 


S 


Threonine 


Thr 


T 


Tryptophan 


Tip 


W 


Tyrosine 


Tyr 


Y 


Valine 


Val 


V 



10 [0027] When the three-letter abbreviations are used, unless specifically preceded by an "L" or 
a or clear from the context in which the abbreviation is used, the amino acid may be in 
either the L- or D-configuration about a-carbon (Co). For example, whereas "Ala" 
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designates alanine without specifying the configuration about the cc-carbon, c< D-AIa" and "L- 
Ala" designate D-alanine and L-alanine, respectively. When the one-letter abbreviations are 
used, upper case letters designate amino acids in the L-configuration about the a-carbon and 
lower case letters designate amino acids in the D-configuration about the a-carbon. For 
5 example, "A" designates L-alanine and "a" designates D-alanine. When peptide sequences 
are presented as a string of one-letter or three-letter abbreviations (or mixtures thereof), the 
sequences are presented in the N-»C direction in accordance with common convention. 

[0028] The abbreviations used for the genetically encoding nucleosides are conventional and 
are as follows: adenosine (A); guanosine (G); cytidine (C); thymidine (T); and uridine (U). 
Unless specifically delineated, the abbreviated nucleotides may be either ribonucleosides or 
2'-deoxyribonucleosides. The nucleosides may be specified as being either ribonucleosides 
or 2'-deoxyribonucleosides on an individual basis or on an aggregate basis. When specified 
on an individual basis, the one-letter abbreviation is preceded by either a "d" or an "r," where 
"d" indicates the nucleoside is a 2 , -deoxyribonucleoside and "r" indicates the nucleoside is a 
ribonucleoside. For example, "dA" designates 2 , -deoxyriboadenosine and "rA" designates 
riboadenosine. When specified on an aggregate basis, the particular nucleic acid or 
polynucleotide is identified as being either an RNA molecule or a DNA molecule. 
Nucleotides are abbreviated by adding a "p" t0 represent each phosphate, as well as whether 
the phosphates are attached to the 3'-position or the 5 '-position of the sugar. Thus, 5'- 
nucleotides are abbreviated as "pN" and 3'-nucleotides are abbreviated as "Np where "N" 
represents A, G, C, T or U. When nucleic acid sequences are presented as a string of one- 
letter abbreviations, the sequences are presented in the 5'->3' direction in accordance with 
common convention^ and the phosphates are not indicated. 

The Preferred Embodiments 

25 [0029] As discussed in the Background section, many intracellular and intercellular 

processes, such as intra- and intercellular signaling, occur through a series of protein-protein 
interactions. Oftentimes, such protein-protein interactions are mediated via proteins or 
molecules that act as "chaperones" for smaller molecules such as cyclic peptides. Specific 
examples of such chaperone molecules include immunophilins such as FK binding proteins 

30 and cyclophilins, which bind cyclic peptides such as CsA, FK506 and Rap to form binary 
complexes which then interact with and modulate the activity(ies) of other biological 
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molecules or complexes. The present invention provides compositions and methods that 
capitalize on the activities of such chaperone molecules to screen for and identify candidate 
compounds that alter a phenotype of a cell, such as, for example, by altering an intracellular 
or intercellular signaling pathway or cascade when bound to the chaperone molecule. Little 
5 or no knowledge of the pathway is required. The only requirement is that the biological 
process being investigated produce an observable physiologic or phenotypic change in the 
target cell. The invention also provides for the isolation of the constituents of the pathway, 
and in particular the target molecules with which the candidate compound-chap erone 
complex interacts to mediate its activity, the tools to characterize the pathway, and lead 
1 0 compounds for pharmaceutical development. 

[0030] The principles of the methods of the invention are illustrated in FIG. 1. Referring to 
FIG. 1, a candidate compound 100, which in this case is a cyclic peptide, is administered to 
cells 140 expressing a chaperone 120, which in preferred embodiments is a highly expressed 
protein such as an immunophilin (discussed in more detail, below), to yield cell 142. The 

1 5 cells also express numerous putative target proteins, exemplified by effector molecule 130, 
the activity of which can be modulated by interacting with a complex including chaperone 
120. As a specific example, effector molecule 130 is a protein involved in an intracellular 
signaling cascade. Candidate cyclic peptide 100 comprises two regions: a chaperone binding 
region 112 and a putative target binding region 114. The chaperone binding region includes a 

20 sequence known to interact with chaperone 120 (described in more detail, below). The 

putative target binding region 114 is typically a sequence that is random or semi-random, as 
will be discussed in more detail below. The chaperone binding and putative target binding 
regions may be fused together directly or, as illustrated in FIG. 1, may be spaced apart from 
one another via linkers 116, which may be the same or different, A different candidate cyclic 

25 peptide (not shown), which includes the same chaperone binding region as candidate cyclic 
peptide 100 but has a different putative target binding region, is administered to cells 140 to 
yield cell 144. As illustrated, once inside cells 142 and 144, the candidate cyclic peptides 
bind chaperone 120 via their respective chaperone binding regions to form binary candidate 
cyclic peptide-chaperone ("CCP-C") complexes 122 and 124, respectively. When bound to 

30 chaperone 120, their respective different putative target binding regions, 114 and 118, are 

free to interact with other molecules in the cell, such as effector molecule 130. As illustrated, 
CCP-C complex 122 binds putative target 130 to yield a ternary complex 126. When bound 
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by CCP-C complex 122, the biological activity of effector molecule 130 is modulated, 
leading to an observable phenotypic change in cell 142 (illustrated as cell 142'). However, 
CCP-C complex 124 does not bind effector molecule 130, leaving it to carry out its normal 
biological function(s). As a consequence, cell 144 does not exhibit a change in phenotype. 

5 [0031] As exemplified by FIG. 1, by varying the structure or sequence of the putative binding 
region of the candidate cyclic peptide, cyclic peptides which modulate biological processes 
can be rapidly identified. Significantly, such cyclic peptides can be identified without any 
prior knowledge about the effector molecule. Moreover, since the complex formed between 
the chaperone and cyclic peptide provides more bulk than the cyclic peptide alone, the 

10 present methods are capable of identifying active cyclic peptides that may be missed in 

traditional screening methods that do not utilize chaperones. When used in a library format 
to screen a library of candidate cyclic peptides, each of which includes a unique putative 
target binding region, the methods of the invention provide a powerful means of identifying 
cyclic peptides capable of modulating biological processes, many of which may have been 

1 5 missed using traditional screening techniques. 

[0032] Moreover, the present methods also permit isolation and identification of the target of 
the CCP-C complex (e.g., effector molecule 130 of FIG. 1). Since the chaperone to which 
the candidate cyclic peptides bind is known, cells exhibiting a change of phenotype (e.g., cell 
142' of FIG. 1) can be lysed, the ternary complex 126 isolated from the lysate using, for 
20 example, affinity reagents that specifically bind chaperone 120. Any target molecule {e.g., 
effector molecule 130 of FIG. 1) bound to the CCP-C complex 126 can be dissociated from 
the ternary complex and characterized. As a specific example, ternary complex 126 can be 
isolated from a lysate of cells 142' using immunoaffinity chromatography employing an 
antibody that specifically binds chaperone 120. 

25 [0033] A key feature of the methods of the invention is the use of a chaperone that effectively 
increases the size of the candidate cyclic peptide and also provides a greater number of points 
of contact for effecting protein-protein interactions with target molecules than the cyclic 
peptide alone. For example, as evidenced by complex 122 of FIG. 1, this complex has a 
greater surface area for interacting with target proteins than cyclic peptide 100 alone. As 

30 used herein, a "chaperone" may be any molecule or protein capable of ''presenting" a cyclic 
peptide to another molecule. Thus, "chaperones" are intended to include those proteins 
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commonly known in the art as chaperones, as well as any other proteins or molecules capable 
of binding a candidate cyclic peptide and presenting it to another molecule, as described 
herein. Typically, such a chaperone molecule will be a protein that is known to, or is 
suspected of, being involved in or mediating protein-protein interactions. Numerous proteins 
5 of this type are known in the art, and include by way of example and not limitation, classical 
chaperone proteins (e.g., heat-shock proteins), immunophilins, proteases and other enzymes, 
etc. Specific examples of proteins useful as chaperones in the present methods include, but 
are not limited to: 



Protein Name 




FK506 binding proteinlO 


gi|21361895|ref|NP 068758.2|[21361895] 


heat shock 70kD protein binding 
protein; progesterone receptor- 
associated p48 

protein; putative tumor suppressor 
ST13; Hsp70-interacting protein 


gi| 1 9923 1 93 [reflNP_003 923 .2 1[ 1 9923 1 93] 


peptidylprolyl isomerase-like protein 
3 isoform PPIL3b; cyclophilin-like 
protein 

3; peptidylprolyl cis-trans 
isomerase-like protein 3; PPIase-like 
protein 3 


gi|19557640[refINP - 572028.1|[19557640] 


peptidylprolyl isomerase-like protein 
3 isoform PPIL3b; cyclophilin-like 
protein 

3; peptidylprolyl cis-trans 
isomerase-like protein 3; PPIase-like 
protein 3 


gi|19557636|reflNP_57098U|[19557636] 


FLJ32150 protein; novel protein 
similar to KIAA0417 


gi|18677724|reflNPj443202.2|[18677724] 


peptidylprolyl isomerase-like protein 
3 isoform PPIL3a; cyclophilin-like 
protein 

3; peptidylprolyl cis-trans 
isomerase-like protein 3; PPIase-like 
protein 3 


gi| 1 4277 1 26|refINP_l 15861.1 1[ 14277 126] 


heat shock 70kD protein 2; Heat- 
shock 70kD protein-2 


gi|13676857)ref)NP - 068814.2|[13676857] 
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Protein Name 




stress 70 protein chaperone, 
microsome-associated, 60kD; Stress 
70 protein 

chaperone, microsome-associated, 
p60; human microsomal stress 70 
protein ATPase 
core 


gi|5902126)ref|NP,008879.1|[5902126] 


heat shock 70kD protein 6 
(HSP70B'); Heat-shock 70kD 
protein-6 (HSP70B') 


gi|45045 1 5(ref]NP_002 1 46. 1 1[45045 1 5] 


1 50 kDa oxygen-regulated protein 
precursor (Orpl50) 


gi|22063141|reflXP^006464.8|[22063141] 


Heat shock cognate 71 kDa protein 


gi|22061946|ref]XP 171 626. 1|[2206 1946] 


heat shock 70kD protein 1-like 


gi|22056764|ref|XP_l 75 1 77. 1 1[22056764] 


heat shock 70kD protein 8; Heat 
shock cognate protein 70 


gi|22055166|ref]XPJ)66483.3|[22055166] 


heat shock protein 


gi}22055 141 |reflXP_l 75247. 1 )[22055 141] 


heat shock protein 


gi|20555572)reflXP_ 166348. 1|[20555572] 


heat shock 70kD protein 8; Heat 
shock cognate protein 70 


gi|20550194|ref]XP_090256.4|[20550194] 


HEAT SHOCK 70 KDA PROTEIN 
4 (HEAT SHOCK 70-RELATED 
PROTEIN APG-2) 
(HSP70RY) 


gi|20547107|refpCPJ14482.1|[20547107] 


heat shock 70kD protein 8; Heat 
shock cognate protein 70 


gi|20540302|reflXP J 66540. 1 1[20540302] 


heat shock 70kD protein 8; Heat 
shock cognate protein 70 


gi( 1 8595470|reflXP_088650. 1 1[18595470] 


Hypothetical protein KIAA0417 


gi)18576822|reflXP 048898.3)[18576822] 


65 kDa Yes-associated protein 
CYAP65) 


gi|18562439|ref]XP_094606.1j[18562439] 


Heat shock cognate 71 kDa protein 


gi|22048365|reflXP 068577.2|[22048365] 


HEAT SHOCK 70 KD PROTEIN 6 
(HEAT SHOCK 70 KD PROTEIN 
B) 


gi|22045701|ref)XP -> 084070.5|[22045701] 


heat shock 70kD protein 8; Heat 
shock cognate protein 70 


gi|22043849|ref]XP^067703.5|[22043849] 


protease; reverse transcriptase; 
RNaseH; integrase; dUTPase; 
Pro-Pol-dUTPase polyprotein 


gi|22043778|ref!XP_065747.4|[22043778] 


heat shock 70kD protein 8; Heat 
shock cognate protein 70 


gi|22042705|re£|XP_171283.1|[22042705] 


heat shock protein 


gi|20542648|reflXP 095553 .4|[20542648] 


Heat shock cognate 71 kDa protein 


gi!20535 1 17|reflXP 06561 1 .2|[20535 117] 
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Protein Name 




7R VPip {jliipn^ft-rpoiilfltpH nrnfpin 

precursor (GRP 78) 

(tnununoglobulin 

heavy chain binding protein) (BIP) 

(Endoplasmic reticulum lumenal 

Ca2+ binding 

protein grp78) 


mll8S46205lreflXP 088941 liri85462051 


glycogenin glucosyltransferase (EC 
2.4.1.186) -human 


gi| 17437880|reflXP_060491.1 1[17437880] 


BAG-family molecular chaperone 


gi|12643895|sp|Q9UL15|BAG5_HUMAN[12643895] 


BAG-family molecular chaperone 
regulator-3 (BCL-2 binding 

(Bcl-2-binding protein Bis) 
(Docking protein CAIR-1) 


gi|12643665|sp|095817|BAG3_HUMAN[12643665] 


R A fr-fiirmlv mnlppnlar rliflnprnnft 

regulator-4 (Silencer of death 

U-UiJJ.cUJ.lo J 


eill2643661lsol095429IBAG4 HUMANri26436611 


Telomerase-binding protein p23 

fi-TcnQO rn-pVianernnpA fPropesterone 

receptor 
complex p23) 


gi|8928247|sp|Q15 1 85|TEBP_HUMAN[8928247] 


TTeat ^hnck-related 70 kDa nrotein 2 
(Heat shock 70 kDa nrotein 2) 


rill708307lsolP54652lHS72 HUMANri7083071 


Heat shock 70 kDa protein 1 
(HSP70.1) (HSP70-1/HSP70-2) 


gi|462325|sp[P08107|HS71_HUMAN[462325] 


stress 70 protein chaperone, 
microsome-associated, 60kD human 
homolog 


gi|9507149|reflNP_062144.1|[9507149] 


rmrOpntilinQTTiiTi/TiiiplpnTilaQTTiiTi "\ 

LLU 1 ^ l CL IvJU 1 11 " UUv iciut iifl^i 1 1 1 1 1 j 


eil6857818lreflNP 008924 lir68578181 


TTpat cVinpk Tvrnfpm 7S kna 

mitochondrial precursor (HSP 75) 
(Tumor necrosis 
factor type 1 receptor associated 
protein) (TRAP-1) (TNFR- 

AQcnniated nrotein 1^ 


1 1 7865679lsnl01 293 1 ITRAL HUMANfl 78656791 


"NTiiplpnnlasTniTi *3 


eil6647592lsDlO75607INPM3 HUMANr66475921 


BAG-family molecular chaperone 
regulator- 1 (BCL-2 binding 
athanogene-1) (BAG-1) 
(Glucocorticoid receptor-associated 
protein RAP46) 


gi| 12644477|sp|Q99933 |BAG1_HUMAN[ 12644477] 


importin4 


gi|18874099|reffNP 078934.2|[18874099] 


importin4 


gi| 1 8700635 |gb|AAL78660. 1|AF41 1 122 lfl 8700635] 
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Protein Name 




Calnexin precursor (Major 

histocompatibility complex class I 

antigen-binding 

protein P88) (P90) (IP90) 


gi|543920|sp|P27824|CALX_HUMAN[543920] 


kelch-like 4, isoform 1 


gi|170l7982|ref|NP 06 1990.2|[ 170 17982] 


kelch-like 4, isofonn 2 


gi| 17017980|refINP 476503.1 1[1 701 7980] 


CDC42-binding protein kinase beta; 
CDC42-binding protein kinase beta 
pMPK-like); MRCK beta; DMPK- 
like 


gi| 1 6357474|ref|NP_006026.2|[ 1 6357474] 


heat shock factor binding protein 1 


gi|4557647|ref|NP 001528.1|[4557647] 


KIAA1052 protein 


gi|7662464|reflNP 05577 1.1|[7662464] 


heat shock 105kD; heat shock 
105kD alpha; heat shock 105kD beta 


gi|5729879|reflNP_006635. 1 1[5729879] 


nucleophosmin/nucleoplasmin 3 


gi| 1 61 1 8247|gb|AAL 1 2 1 72. 1 ][ 1 6 1 1 8247] 


crystallin, alpha A; human alphaA- 
crystallin; crystallin, alpha-1 


gi|4503055|rei]NP_000385.1|[4503055] 


crystallin, alpha B; crystallin, alpha- 
2; Rosenthal fiber component; heat- 
shock 

20 kD like-protein 


gi|4503057|reflNP 001876.1|[4503057] 


calnexin - human 


gi|2134858|pir||I53260r21348581 


calnexin 


gi| 1 07 1 6563 IreflNP 00 1 737. 1 1[ 1 07 1 6563] 


heat shock protein 75 


gi|2865466|gb|AAC02679.1|[28654661 


calnexin 


gi| 1 79832|gb| AAA2 1 749. 1 |fl 798321 



[0034] Preferably, such a chaperone is a protein that is highly expressed and present in the 
target cell population being assayed. Chaperones that are highly expressed permit the 
screening of candidate cyclic peptides without disrupting or interfering with the other 
5 biological activities of the particular chaperone, as the cyclic peptide will bind only a small 
fraction of the chaperone molecules present in the cell. 



[0035] The chaperone may be ubiquitous or non-ubiquitous. Chaperones that are non- 
ubiquitous may provide advantages in identifying therapeutic cyclic peptides having activities 
target to specific cell types. Upon administration of the cyclic peptide to a human or animal, 
10 active complexes will form only in those cell types expressing the specific chaperone bound 
by the cyclic peptide. Cells that do not express the chaperone will be unaffected by the 
presence of the cyclic peptide. 

[0036] One class of highly expressed proteins that are ideal for use as chaperones in the 
present methods is the immunophilins. As discussed in the Background section, 
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immunophilins, which include FK binding proteins and cyclophilins, are known to bind 
immunosuppressant cyclic peptides or macrocycles, such as FK506, Rapamycin ("Rap") or 
cyclosporin A ("CsA'*). Such immunophilin-drug binary complexes inhibit or interfere with 
signal transduction cascades by acting on specific effector molecules. For example, both 
5 CsA-CypA and FK506-FKBP complexes inhibit transcription of lymphokine genes and 
consequent activation of T-cells. The target of these complexes has been identified as the 
Ca 2+ /calmodulin dependent serine/threonine phosphatase calcincurin (Liu et al, 1991, Cell 
66:807; liu et al, 1992, Biochemistry 31:3896; Flanagan et al, 1992, Nature 352:803; 
McCaffrey et al, 1993, J. Biol Chem. 268:3747; McCaffrey 3t al., 1993, Science 262:750). 

10 Likewise, Rap-FKBP complexes interfere with mitogen-induced T-cell activation, B-cell 
proUferation and proliferation induced by several cytokines, including IL-2, EL-3, IL-4 and 
IL-6 (Seghgal et al, 1994, Med. Res. Rev. 14:1-22); interfere with a calcium independent 
signaling cascade in T-cells and mast cells (Schreiber et al, 1992, Tetrahedron 48:2545- 
2558); inhibit cytokine-induced activation of p70 56 kinase (Calvo et al, 1992, Proc. Natl. 

15 Acad. Sci. USA 89:7571-7575; Chung et al, 1992, Cell 69:1227-1236; Kuo et al, 1992, 
Nature 358:70-73; Price etal, 1992, Science 257:973-977); and inhibit activation of 
cdk2/cyclin E complex (Flanagan et al, 1993, J. Cell Biochem. 17A:292). The specificity of 
the immunophilin-drug complexes is presumed to be due to the bound drug. In particular, as 
exemplified by the above discussion, while the FK binding protein FKBP12 (described in 

20 more detail, below) binds both FK506 and Rap to form biologically active complexes, the 
respective complexes have different targets. As discussed in the Background Section, 
structure similarities between the regions of IP3R, FK506 and Rap that bind FKBP12 tend to 
confirm this theory. 

[0037] Moreover, the immunophilins are highly expressed proteins. The activities of FK506, 
25 Rap and CsA are known to require only a small fraction of their respective available 
immunophilins. For example, in the case of FK506 and Rap, it is estimated that these 
compounds effect their biological activities by binding only l/300th of the expressed FK 
binding protein. Thus, the immunophilins make ideal chaperones to identify cyclic peptides 
intended for therapeutic applications. When administered to humans or animals, such 
30 identified cyclic peptides need only bind a small fraction of the available immunophilin, 
leaving the remainder of the immunophilin to carry out its other biological functions. 
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[0038] Numerous inununophilins that can be used as chaperones in accordance with the 
principles of the invention are known in the art. The family of currently known FK560 
binding proteins (FKBPs), which is characterized by FKBP12 (see, Harding et al, 1989, 
Nature 341:758-760; Siekierka et aL 9 1989, Nature 341:755-757; European Patent 
5 Application No. 0 379 342, the latter two of which set forth the nucleotide and deduced 
amino acid sequence of cDNA encoding FKBP12) include FKBP13 {see, e.g., U.S. Patent 
No. 5,498,597), FKBP12.6, FKBP25, FKBPr38, FKBP51 and FKBP52. These various 
FKBPs, as well as their various biological activities and targets, are reviewed in Annistead & 
Harding, 1993, Ann. Rep. Med. Chem. 28:207-215. Any of these FKBPs, or later- 
10 discovered FKBPs, may be used as a chaperone in connection with the methods of the 
invention. 

[0039] The family of currently-known cyclophilins include CypA, CypB, CypC and CypD. 
The cyclophilins are reviewed in, e.g., Bukrinsky, 2002, Trends Immunol. 23(7):323-325 and 
Andreeva et al y 1999, hit. J. Exp. Pathol. 80(6):305-315. Any of these cyclophilins, or later- 
1 5 discovered cyclophilins, may be used as a chaperone in connection with the methods of the 
invention. 

[00401 Referring to FIG. 1, at a minimum, candidate cyclic peptides comprise two regions: a 
chaperone binding region 112 and a putative target binding region 114. The sequence of the 
chaperone binding region will depend upon the specific chaperone, and will be apparent to 

20 those of skill in the art. For example, the sequence of the chaperone binding region may be 
designed to correspond to a peptide or region of a polypeptide known to bind the specific 
chaperone selected. Alternatively, the sequence may be designed to correspond to conserved 
or consensus sequences known (or suspected) to bind the specific chaperone selected. As 
another example, if the three dimensional structure of the chaperone is available, a chaperone 

25 binding region can be designed in silico based upon available structural information. 

[0041J Alternatively, the chaperone binding sequence may be determined from binding 
experiments carried out with the chaperone. In one embodiment, the chaperone is contacted 
with a library of peptides to identify those peptides of the library that bind the chaperone. 
The sequence of the chaperone binding region of the candidate cyclic peptides can 
30 correspond to the sequence of an identified peptide. If there are known ligands for the 

chaperone, the binding experiments may be carried out in a competitive fashion to identify 
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those peptides that competitively bind the chaperone. For example, peptides that bind an 
FKBP in approximately the same region as FK506 can be identified in competitive binding 
experiments with an FKBP-FK506 complex. Peptides that bind a cyclophilin at 
approximately the same region as CsA may be identified in competitive binding experiments 
5 with a CypA-CsA complex. Myriad techniques and methods for screening libraries of 
peptides for the ability to bind a specific target protein of interest and for carrying out 
competitive binding assays that can be used to identify chaperone binding sequences suitable 
for use in the method of the invention are known in the art. Any of these techniques may be 
employed to identify and/or design chaperone binding sequences. 

10 [0042] One art-known approach uses recombinant bacteriophage to produce large libraries of 
peptides which can then be screened in a variety of formats for binding to a specific 
chaperone of interest. Methods for constructing and screening such "phage display" libraries 
are described, for example, in Scott & Smith, 1990, Science 249:386-390; Cwirla et al> 1990, 
Proc. Natl. Acad. Sci. 87:6378-6382; 1990); Devlin etal, 1990, Science 249:404-406 

15 (1990); U.S. Patent No. 5,427,908; U.S. Patent No. 5,432,018; U.S. Patent No. 5,580,717 and 
U.S. Patent No. 5,723,286, the disclosures of which are incorporated herein by reference. 
Other non-limiting examples of recombinant library methodologies that may be used to 
identify peptides that bind a chaperone of interest are described in U.S. Patent No. 6,156,571; 
U.S. Patent No. 6,107,059 and U.S. Patent No. 5,733,731, the disclosures of which are 

20 incorporated herein by reference. 

[0043] A second art-known approach uses chemical methods to synthesize libraries of 
compounds, such as small organic compounds, peptides and/or peptide analogs, attached to 
beads or wafers that can then be conveniently screened for binding with a chaperone of 
interest. The libraries may be encoded or non-encoded. Methods of synthesizing such 

25 immobilized libraries, as well as methods of screening the libraries are described, for 

example, in Houghten, 1985, Proc. Natl. Acad. Sci. USA 82:5131-5735; Geysen et ai 9 
1986, Molecular Immunology 23:709-715; Geysen et al, 1987, J. Immunologic Method 
102:259-274; Frank & Doring, 1988, Tetrahedron 44:6031-6040; Fodor et aL, 1991, Science 
251:767-773; Furka etal, 1988, 4th International Congress of Biochemistry, Volume 5, 

30 Abstract FR:013; Furka, 1991, Int J. Peptide Protein Res. 37:487-493; Frank, 1992, 
Tetrahedron 48:9217-9232; Needels etal, 1993, Proc. Natl. Acad. Sci. USA 
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90:10700-10704; DeWitt etal 9 1993, Proc. Natl. Acad. Sci. USA 90:6909-6913; Frank et 

a/., 1993,Biorg. Med Chem. Lett. 3:425-430; Ohlmeyere/ a/., 1993, Proc. Natl. Acad. 

Sci. USA 90:10922-10926; WO 92/00252; WO 9428028; U.S. Patent No. 6,329,143; U.S. 

Patent No. 6291,183; U.S. Patent No. 5,885,837; U.S. Patent No. 5,424,186; U.S. Patent No. 
5 5,384,261; U.S. Patent No. 6,165,717; U.S. Patent No. 6,143,497; U.S. Patent No. 6,140,493; 

U.S. Patent No. 5,789,162; U.S. Patent No. 5,770,358; U.S. Patent No. 5,708,153; U.S. Patent 

No. 5,639,603; U.S. Patent No. 5,541,061; U.S. Patent No. 5,525,735; U.S. Patent No. 

5,525,734; U.S. Patent No. 6,261,776; U.S. Patent No. 6,239,273; U.S. Patent No. 5,846,839; 

U.S. Patent No. 5,770,455; U.S. Patent No. 5,770,157; U.S. Patent No. 5,609,826; U.S. Patent 
10 No. 6,001,579; U.S. Patent No. 5,968,736; U.S. Patent No. 5,962,337; U.S. Patent No. 

5,789,172; U.S. Patent No. 5,721,099; U.S. Patent No. 5,565,324; U.S. Patent No. 5,010,175; 

and U.S. Patent No. 4,631,21 1, the disclosures of which are incorporated herein by reference. 

For reviews of some of these techniques, see Ellman et al % 1996, Account, Chem. Res. 

29:132-143; Gallop et aL y 1994, J. Med. Chem. 37:1233-1251; Gordon^/., 1994, J. 
15 Med. Chem. 37:1385-1401. Additional review articles, references, patents and books 

describing myriad techniques for synthesizing and screening libraries of peptides for the 

ability to bind a chaperone of interest can be found at Lebl & Leblova: Dynamic Database of 

References in Molecular Diversity, Internet http://www.5z.com (see especially the diversity 

information pages at http://www.5z.com/divinfo) . 

20 [0044] In an alternative embodiment, sequences suitable for use in the chaperone binding 
• region can be identified using the power of the immune system. For example, antibodies 
specific for the chaperone can be generated using standard techniques (described in more 
detail, below) and the sequence(s) of the antigenic regions or portions determined. These 
sequences can then be used to define the sequence of the chaperone binding region of 

25 candidate cyclic peptides. Skilled artisans will recognize that myriad techniques for 

generating antibodies and/or fragments of antibodies that specifically bind molecules are 
well-known in the art. Myriad library techniques for identifying antibodies, antibody 
fragments and/or single-chain variants thereof are also known in the art. Any of these 
methods may be used to identify sequences useful for the chaperone binding region of the 

30 candidate cyclic peptides. Some of these various immunological methods are described in 
more detail, below. 
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[0045] In a preferred embodiment, the chaperone binding region includes a sequence capable 
of binding an immunophilin. A specific sequence capable of binding the immunophilin 
CypA that may be used in connection with the invention is Ala-Gly-Pro-De. A specific 
sequence capable of binding the immu nophilin FKBP12 that may be used in connection with 
5 the invention is Leu-Pro. 

[0046] Referring to FIG. 1, the candidate cyclic peptides also include a putatitive target 
binding region. The putative target binding region constitutes a Variable" region of the 
candidate cyclic peptide. The sequence of this region can be completely random. For 
example, the sequence of the putative target binding region may include a random peptide 

10 sequence composed of 2, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, or even 
more amino acid residues. Alternatively, the sequence can include one or more variable or 
random regions flanked by one or more regions of defined sequence(s). As a specific 
example, the sequence of the putative binding region may include a region of defined 
sequence disposed between two variable regions, or vice versa. The region(s) of defined 

1 5 sequence(s) can be selected to reduce the complexity of the library or for other reasons, such 
as, for example, to perform specified functions. For example, the defined sequences can be 
selected so as to restrict conformationally any variable or random sequences fused thereto. A 
variety of different defined sequences that can be included in the target binding region of the 
candidate cyclic peptides are described in U.S. Patent No. 6,153,380 (see especially Cols. 4- 

20 12), the disclosure of which is incorporated herein by reference. These sequences can be 
used alone or in various combinations. The regions of defined sequence(s) can be fused 
directly to the random or variable regions, or they may be spaced away from such regions via 
one or more of the same or different linkers (described in more detail, below). 

[0047] In a preferred embodiment, the putative binding region is a random sequence ranging 
25 from 2 to 20 amino acids in length, more preferably from 2 to 8 amino acids in length. 

[0048] As illustrated in FIG. 1, the chaperone binding and putative target binding regions are 
fused together, optionally by way of linkers 116. When linkers are used, they may be the 
same or different, and may vary widely in properties. For example, the linkers may be 
flexible or rigid, hydrophobic or hydrophilic or long or short, depending upon a particular 
30 application. Preferably, any optional linkers used will be flexible. Suitable flexible linkers 
include polyglycine, glycine-serine polymers, including, for example, (GS) n , (GSGGS) n and 
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(GGGS)n where n is an integer of at least one, typically from 1 to 2; glycine-alanine 
polymers, alanine-serine polymers, and other flexible linkers such as the tether for the shaker 
potassium channel, and a large variety of other flexible linkers, as will be appreciated by 
those in the art. 

5 [0049] The candidate cyclic peptides may be synthesized chemically and administered to the 
cells by contacting the cells with the candidate cyclic peptide. Myriad techniques for 
synthesizing cyclic peptides are well-known {see, e.g., Tarn et al, 2000, Biopolymers 
52:311-332; Camameroetf al, 1998, Angew. Chem. Intl. Ed. 37: 347-349; Tarn etal, 1998, 
Prot. Sci. 7:1583-1592; Jackson et al, 1995, J. Am. Chem. Soc. 117:819-820; Dong et 

10 al, 1995, J. Am. Chem. Soc. 1 17:2726-2731; Ishida et al, 1995, J. Org. Chem. 60:5374- 
5375; WO 95/33765, published Jun. 6, 1995; Xue and DeGrado, 1994, J. Org. Chem. 
60(4):946-952; Jacquier et al, 1991, In: Peptides 1990 221-222, Giralt and Andreu, Eds., 
ESCOM Leiden, The Netherlands; Schmidt andNeubert, 1991, In: Peptides 1990 214-215, 
Giralt and Andreu, Eds., ESCOM Leiden, The Netherlands; Toniolo, 1990, Int. J. Peptide 

15 Protein Res. 35:287-300; Ulysse et al, 1995, J. Am. Chem. Soc. 1 17:8466-8467; Durref 
al, 1991, Peptides 1990 216-218, Giralt and Andreu, Eds., ESCOM Leiden, The 
Netherlands; Lender et al, 1993, Mt. J. Peptide Protein Res. 42:509-517; Boger and 
Yohannes, 1990, J. Org. Chem. 55:6000-6017; Brady et al, 1979, J. Org. Chem. 
4(18):3101-3105; Spatolaef a/., 1986, J. Am. Chem. Soc. 108:825-831; Seidell al, 1991, 

20 In: Peptides 1990 236-237, Giralt and Andreu, Eds., ESCOM Leiden, The Netherlands; 
Tanizawaef a/., 1986, Chem. Phar, Bull. 34(10):4001-4011; Goldenburg & Creighton, 
1983, J. Mol. Biol. 165:407-413). The cyclic peptides can be synthesized on a compound- 
by-compound basis, or, alternatively, the cyclic peptides may be synthesized as libraries by 
routine adaptation any of the myriad known methodologies for synthesizing libraries of 

25 compounds. Addition methods that may routinely adapted to the synthesis of cyclic peptide 
libraries are described in the art (see, e.g., Bunin, 1998, The Combinatorial Index, Academic 
Press, San Diego, CA, and the various references cited therein). 

[0050] Preferably, however, the candidate cyclic peptides are administered to the cells by 
way of candidate polynucleotides capable of expressing the cyclic peptides inside the cell. A 
30 variety of polynucleotides capable of expressing cyclic peptides in cells that utilize the trans 
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splicing ability of inteins have been developed. Such polynucleotides can be used to 
administer candidate cyclic peptides to cells in accordance with the method of the invention. 

[0051] Inteins are protein splicing elements that mediate their excision from precursor 
proteins and the joining of the flanking proteins (N-extein and C-extein) to yield two mature 
5 proteins: the intein and the ligated protein (Perler et aL, 1994, Nucl. Acids Res. 22:1 125- 
1 127). The bond formed between the ligated exteins is a native peptide bond (Perler et aL, 
1997, Curr. Opin. Chem. Biol. 1:292-299). The self-catalytic splicing reaction requires 
four nucleophilic displacements mediated by three conserved splice junction residues: (i) a 
Ser, Cys or Ala at the intein N- terminus; (ii) an Asn or Asp at the intein C- terminus; and (iii) 

10 a Ser, Thr or Cys at the beginning of the C-extein (Xu et aL, 1993, Cell 75:1371-1377; Xu et 
aL, 1994, EMBO J. 13:5517-5522; Shao et aL 1995, Biochemistry 34:10844-10850; Chong et 
aL, 1996, J. Biol. Chem. 271:22159-22168; Xu& Perler, 1996, EMBO J. 15:5146-5153). 
An example of an intein-mediated protein splicing reaction is illustrated in FIG. 2. Referring 
to FIG. 2, precursor protein 2, which comprises an intein 4 flanked by an N-terminal extein 6 

15 and a C-terminal extein 8, undergoes a protein splicing reaction, which is mediated or 
catalyzed by intein 4, to yield the excised intein 12 and fusion protein 10. As illustrated, 
intein 4 comprises an N-terminal protein splicing domain 5 (In) and a C-terminal protein 
splicing domain 7 (Ic) flanking an endonuclease domain 9 (EN). Fusion protein 10 
comprises N-terminal extein 6 fused to C-tenninal extein 8 via a native peptide bond 3. 

20 [0052] The mechanism of action of the intein-mediated protein splicing reaction is illustrated 
in FIG. 3. Referring to FIG. 3, which illustrates the precursor protein 2 of FIG. 2 showing a 
required Ser 20 at the intein N-terminus, the required Asn 22 at the intein C-terminus, and a 
required Ser 24 at the N-terminus (beginning) of the C-terminal extein 8, the splicing reaction 
proceeds via four steps: (1) Step 1 is an N-0 acyl shift that occurs between the C-tenninal 

25 amino acid of N-terminal extein 6 and the N-terminal Ser 20 of intein 4, resulting in the 

formation of a reactive ester 30; (2) in Step 2, the ester 30 is the focus of a transesterfication 
reaction that results in a branched intermediate 32 in which N-terminal extein 6 is attached 
via an ester to the N-terminal Ser 24 of C-terminal extein 8; (iii) in Step 3, branched 
intermediate 32 is resolved by the cyclization of the intein C-tenninal Asn residue 22 to form 

30 a succinimide group 26, the excised intein 12 is released from precursor 2 and N-terminal 
extein 6 and C-terminal extein 8 are joined by an ester bond 28; (iv) in Step 4, a spontaneous 
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ON acyl shift generates a native peptide bond 3 between exteins 6 and 8 at the condensation 
point to yield fusion protein 10. 

[0053] Split inteins, whether naturally occurring or artificially created by splitting a 
contiguous intein into two distinct "halves," are capable of catalyzing a trans ligation reaction 
5 that yields an extein product cyclized in a head-to-tail fashion {see, e.g., Southworth et aL, 
1998,EMBOJ. 17:918-926; Xuet aL, 1999, Proc. Natl. Acad. Sci. USA 95:6705-6710; 
Evans et aL, 1999, Biochemistry 274:18359-18363; Scott et aL, 1999, Proc. Natl. Acad. 
Sci. USA 96:13638-13643). The splitting of an intein to catalyze a trans ligation reaction 
generally requires segregating the In and Ic protein splicing domains and reversing their 
10 translational order such that the splicing and ligation reaction fuses the former N- and 

C-termini. This process is well-understood and is described, for example, in WO 01/66565 
(see, e.g., FIGS. 1A, IB, 2A and 2B and the text associated therewith). 

[0054] An example of such a split intein construct and the cyclization reaction catalyzed 
thereby is illustrated in FIG. 4. Referring to FIG. 4, precursor protein 50 comprises an extein 
15 52 interposed between two intein protein splicing domains 54, 56. The upstream intein 
domain 54 corresponds to the C-tenninal (Ic) domain 7 of FIG. 2. The downstream intein 
domain 56 corresponds to the N-terminal (I N ) domain 5 of FIG. 2. 

[0055] The intein illustrated in FIG. 2 is one of many inteins that is bifunctional in that it 
mediates both protein splicing and DNA cleavage. This latter activity is mediated by the 

20 endonuclease domain (EN) 9 that interrupts the N- and C-terminal intein protein splicing 
domains 5, 7 illustrated in FIG. 2. Because the endonuclease activity is not required for 
protein splicing, split intein constructs capable of expressing cyclic peptides that are designed 
from such bifunctional inteins need not include the endonuclease domain (see, e.g, Wood et 
aL, 1999, Nature Biotechnology 17:889-892). Thus, the precursor protein 50 illustrated in 

25 FIG. 4 does not include an endonuclease domain. 

[0056] Referring again to FIG. 4, when such a precursor protein 50 is expressed in an 
appropriate host system, Ic 54 and I N 56 physically come together to form an active intein 
(illustrated in FIG. 5) that catalyzes a protein splicing reaction that yields an extein cyclized 
in a head-to-tail fashion 58 and released C- and N-terminal intein domains 60 and 62. Like 
30 the splicing reaction catalyzed by the contiguous intein illustrated in FIGS. 2 and 3, the 
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splicing reaction catalyzed by the split intein construct 50 of FIG. 4 also requires four 
nucleophilic displacements mediated by three conserved splice junction residues: (i) a Ser, 
Cys or Ala at the N-terminus of In 56; (ii) an Asn or Asp at the C-tenninus of Ic 54; and (iii) 
a Ser, Cys or Thr at the N-tenninus (beginning) of extein 52 

5 [0057] The mechanism of the trans splicing reaction is illustrated in FIG. 5. In FIG. 5, the C- 
terminal Asn residue of Ic 54, the N-terminal Ser residue of extein 52 and the N-tenninal Ser 
residue of In 56 are illustrated. In Step 1, Ic 54 and In 56 come together physically to yield an 
active intein 64 having a conformation that forces extein 52 into a loop configuration. In 
Step 2, the N-terminal Ser of In 56 undergoes an O-C acyl migration to yield a reactive ester 
10 66. In Step 3, the oxygen of the N-terminal Ser residue of extein 52 reacts with the ester to 
yield a lariat product 68 and released In 62. The active intein then resolves the lariat 68 via 
the formation of a succinimide that liberates a cyclized lactone form of the extein 70 and Ic 
60. The lactone 70 then spontaneously rearranges to form the thermodynamically favored 
head-to-tail lactone form of the cyclic peptide 58. 

15 [0058] Candidate polynucleotides useful in the methods of the invention exploit this trans 
splicing ability of split inteins to express candidate cyclic peptides in cells. Thus, the 
candidate polynucleotides generally comprise a first segment encoding a C-terminal intein 
protein splicing domain (Ic), a second segment encoding a linear version of a candidate cyclic 
peptide and a third segment encoding an N-tenninal intein protein splicing domain (In). The 

20 three segments are arranged such that when expressed, the polynucleotide yields the 
precursor protein 50 (FIG. 4) in which extein 52 corresponds to the linear version of a 
candidate cyclic peptide. Precursor protein 50 then undergoes spontaneous splicing to yield 
the candidate cyclic peptide (element 58 of FIG. 4). 

[0059] Nucleotide sequences that encode Ic and I N may be derived from naturaUy-occurring 
25 split inteins (i.e. 9 inteins that in nature are produced as two distinct polypeptide chains) or 
from contiguous inteins that have been artificially split and rearranged using known 
techniques. One example of a naturally-occurring split intein that may be used in connection 
with the polypeptides of the invention is SspDnaE(Wu et aL 9 1998, Proc. Natl. Acad. Sci. 
USA 95:9226). Another specific example is Ssp DnaB (Wu et aL, 1998, Biochem Biophys 
30 Acta 1387:422-432; Evans etal, 1999, J. Biol. Chem. 274:18359-63). A fairly 

comprehensive list of contiguous inteins that may be artificially split and rearranged to 
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express candidate cyclic peptides according to the invention is published by New England 
Biolabs at http:/Avww.nebxom/inteins/int_req.html. Guidance for splitting and rearranging 
such inteins for use in the candidate polynucleotides of the invention may be found, for 
example, in WO 01/66565; Evans et aL, 1999, J. Biol. Chem. 274:18359; Mills et aL, 1998, 
5 Proc. Natl. Acad. Sci. USA 95:3543. Specific examples of contiguous inteins that may be 
split and rearranged in accordance with the invention include Psp-Pol-1 (Southworth et aL, 
1998, EMBO J. 17:918), Mycobacterium tuberculosis RccA intein (Lew et aL, 1998, J. Biol 
Chem. 273:15887; Shingledeckere/ a/., 1998, Gene 207:187; Mills etaL, 1998, Proc. Natl. 
Acad. Sci. USA 95:3543), Ssp/DnaB/MxeGyrA (Evans et aL, 1999, J. Biol. Chem. 
10 274:18359) and Pfu (Otoma et aL, 1999, Biochemistry 38:16040; Yamazaki et aL, 1998, J. 
Am. Chem. Soc. 120:5591). Additional specific examples are found in WO 01/66565. All 
of these various references and the inteins described therein are incorporated herein by 
reference. 

[0060] According to the methods of the invention, the candidate polynucleotides are 
15 introduced into cells to screen for and identify expressed candidate cyclic peptides that alter a 
phenotype of the cells. By "introduced into" or grammatical equivalents thereof is meant that 
the candidate polynucleotides enter the cells in a manner suitable for subsequent expression 
of the candidate cyclic peptides. The method of introduction is largely dictated by the 
targeted cell type, discussed in more detail below. Exemplary methods include CaP04 
20 precipitation, liposome fusion, lipofectin®, electroporation, viral infection, etc. The 

candidate polynucleotides may stably integrate into the genome of the host cell (for example, 
with retroviral introduction, outlined below), or may exist either transiently or stably in the 
cytoplasm (le., through the use of traditional plasmids, utilizing standard regulatory 
sequences, selection markers, etc.). As many pharmaceutically important screens require 
25 human or model (e.g., murine, simian, etc.) mammalian target cells the use of retroviral 
vectors capable of transfecting such targets are preferred. 

[0061] In one embodiment, the candidate polynucleotides are part of a retroviral particle 
which infects the cells. Generally, infection of the cells is straightforward with the 
application of the infection-enhancing reagent polybrene, which is apolycation that facilitates 
30 viral binding to the target cell. Infection can be optimized such that each cell generally 
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expresses a single construct, using the ratio of virus particles to number of cells. Infection 
follows a Poisson distribution. 

[0062] In another embodiment, the candidate polynucleotides are introduced into the cells 
using the well-known retroviral vectors and infection techniques pioneered by Richard 
5 Mulligan and David Baltimore with Psi-2 lines and analogous retroviral packaging systems 
based upon NIH 3T3 cells (see Mann et al, 1993, Cell 33:153-159, the disclosure of which is 
incorporated herein by reference). Such helper-defective packaging cell lines are capable of 
producing all of the necessary trans proteins (gag, pol and env) required for packaging, 
processing, reverse transcribing and integrating genomes. Those RNA molecules that have in 

10 cis the $ packaging signal are packaged into maturing retrovirions. Virtually any of the art- 
known retroviral vectors and/or transfection systems may be used. Specific non-limiting 
examples of suitable transfection systems include those described in; WO 01/66565, WO 
00/36093; WO 97/27213; WO 97/27212; Choate et al, 1996, Human Gene Therapy 7:2247- 
2253; Kinsella et al, 1996, Human Gene Therapy 7:1405-1413; Hofinann et al, 1996, Proc. 

15 Natl. Acac. Sci. USA 93:5185-5190; Kitamura et al, 1995, Proc. Natl. Acac. Sci. USA 
92:9146-9150; WO 94/19478; Pear et al, 1993, Proc. Natl. Acac. Sci. USA 90:8392-8396; 
Mann et al, 1993, Cell 33:153-159 and the references cited in all of the above, the 
disclosures of which are incorporated herein by reference. 

[0063] In one embodiment of the invention, a library of candidate polynucleotides is 
20 generated in a retrovirus DNA construct backbone, as is generally described in U.S. Patent 
No. 6,153,380 (see, e.g., Cols. 16-17) and WO 01/66565, the disclosures of which are 
incorporated herein by reference. Any number of suitable retroviral vectors may be used. 
Generally, the retroviral vectors may include: selectable marker genes under the control of 
internal ribosome entry sites (IRES), which allows for bioistronic operons and thus greatly 
25 facilitates the selection of cells expressing candidate cyclic peptides at uniformly high levels; 
and promoters driving expression of a second gene, placed in sense or anti-sense relative to 
the 5 ! LTR. Suitable selection genes include, but are not limited to, neomycin, blastocidin, 
bleomycin, puromycin, and hygromycin resistance genes, as well as self- fluorescent markers 
such as green or blue fluorescent protein, enzymatic markers such as lacZ, and surface 
30 proteins such as CD8, etc. Specific non-limiting examples of suitable retroviral vector 

systems include vectors based upon murine stem cell virus (MSCV) as described in Hawley 
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et al, 1994, Gene Therapy 1:136; vectors based upon a modified MGF virus as described in 
Rivere et al, 1995, Genetics 92:6733; pBABE as described in WO 97/27213 and WO 
97/27212; the vectors depicted in FIG. 1 1 of WO 01/34806 and the vectors described in WO 
01/66565 and WO 00/36093, the disclosures of which are incorporated herein by reference. 

5 [0064] A specific example of a retroviral vector useful for creating a candidate 

polynucleotide library and expressed candidate cyclic peptide library is illustrated in FIG. 6. 
Referring to FIG. 6, the construct includes retroviral mutated non-functional LTR promoters 
("SIN") flanking an inverted intein (fc and In). The illustrated amino acids of the intein are 
based upon the sequence of the inverted DnaBintein (Scott et al, 2001, Chem. Biol. 8:801- 

10 815). The Ic and In domains of the inverted intein flank the region encoding the cyclic 

peptide (library insert; extein). As discussed previously, inteins require nucleophilic residues 
after each splice junction for splicing activity at the junction (Mathys et al, 1999, Gene 
231:1-13) and have an invariant Asn residue at the C-tenninus of Ic. In FIG. 6, the invariant 
Ser residue is boxed, and the required nucleophilic residues Cys, Asn and His of I N and Ic are 

15 underlined. The candidate cyclic peptide 100 expressed by this construct is illustrated above 
the vector. Also illustrated are an optional tetracycline-regulated transactivator (tTA) useful 
for regulating the expression of the cyclic peptide with tetracycline or doxycycline (discussed 
in more detail below) and an optional blue fluorescent protein (BFP) useful for monitoring 
introduction of the polynucleotides (and subsequent expression thereof) into cells. 

20 [0065] A specific example of a nucleotide sequence encoding the Ic-extein-lN region of the 
construct of FIG. 6 is provided in FIG. 7. In FIG. 7, nucleotides #1-156 correspond to the Ic 
region of FIG. 6, nucleotides #157-171 correspond to the cyclic peptide (extein nucleotides 
#147-149 encode a fixed Ser residue) region of FIG. 6 and nucleotides #172-489 correspond 
to the Ic region of FIG. 6. Skilled artisans will recognize that in the candidate cyclic peptide 

25 (extein) region of FIG. 7, "N" represents nucleotides encoding the candidate cyclic peptides 
(an AGC codon encoding a Ser residue necessary for splicing is illustrated). The translated 
amino acids are shown below their respective codons. Stop codons are indicated with a 
period ("."). Nucleotides #51 1-1227 encode BFP. 

[0066] Another specific example of a retroviral vector that may be routinely adapted to create 
30 a candidate polynucleotide library is illustrated in U.S. Patent No. 6,1 53,380, FIG. 4, the 

disclosure of which is incorporated herein by reference. Skilled artisans will appreciate that a 
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polynucleotide sequence capable of expressing a candidate cyclic peptide, such as the 
polynucleotide sequence of FIG. 7, may be inserted into the illustrated vector at the region 
labeled peptides are inserted here." 

[0067] The retroviral may include inducible and constitutive promoters. For example, there 
5 are situations wherein it may be desirable or necessary to induce cyclic peptide expression 
only during certain phases of the selection process. In such instances, the cyclic peptide 
expression is only turned on during that phase of the selection desired. A large number of 
both inducible and constitutive promoters that may be used in this context are well-known. 

[0068] In addition, it is possible to configure a retroviral vector to allow inducible expression 
10 of retroviral inserts after integration of a single vector in target cells; importantly, the entire 
system is contained within the single retrovirus. Tefc-inducible retroviruses have been 
designed incorporating the Self-inactivating (SIN) feature of 3 f LTR enhancer/promoter 
retroviral deletion mutant (Hoffman et ah, 1996, PNAS USA 93:5185). Expression of this 
vector in cells is virtually undetectable in the presence of tetracycline or other active analogs. 
1 5 However, in the absence of Tet, expression is turned on to maximum within 48 hours after 
induction, with unif orm increased expression of the whole population of cells that harbor the 
inducible retrovirus, indicating that expression is regulated uniformly within the infected cell 
population. A similar, related system uses a mutated Tet DNA-binding domain such that it 
bound DNA in the presence of Tet, and was removed in the absence of Tet. Either of these 
20 systems is suitable for use in connection with the present invention. 

[0069] Delivery of the candidate polynucleotide library into a retroviral packaging system 
results in conversion to infectious virus. Suitable retroviral packaging system cell lines 
include, but are not limited to, the Bing and BOSC23 cell lines described in WO 94/19478; 
Soneoka et c/,,1995, Nucleic Acid Res. 23(4):628; Finer et aL 9 1994, Blood 83:43; Phoenix 

25 packaging lines such as PhiNX-eco and PhiNX-ampho, described in U.S. Patent No. 

6,153,380 (see, e.g., Col 18, line 9 through Col. 19, line 29); 292T+gag-pol and retrovirus 
envelope; PA317; and cell lines outlined in Markowitz et aL 9 1988, Virology 167:400, 
Markowitze/a/., 1988, J. Virol. 62:1120, Li et a/., 1996, PNAS USA 93:11658, Kinsellart 
aL, 1996, Human Gene Therapy 7:1405, WO 01/66565, WO 00/36093, in WO 97/27213 and 

30 WO 97/27212, all of which are incorporated by reference. If necessary, the virus may be 
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concentrated as described in U.S. Patent No. 6,153,380 {see, e.g., Col 19, lines 35-61), the 
disclosure of which is incorporated herein by reference. 

[0070] The candidate polynucleotides, as part of the retroviral construct, are introduced into a 
plurality of cells to screen for candidate cyclic peptides capable of altering a phenotype of a 
5 cell. By a plurality of cells" is meant roughly from about 10 3 cells to 10 8 or 10 9 cells, with 
from about 10 6 to 10 8 cells being typical. This plurality of cells comprises a cellular library, 
wherein generally each cell within the library contains a member of the retroviral construct 
library, i.e. a different candidate polynucleotide, although as will be appreciated by those in 
the art, some cells within the library may not contain a retrovirus, and some may contain 
10 more than one. When methods other than retroviral infection are used to introduce the 
candidate polynucleotides into a plurality of cells, the distribution of candidate 
polynucleotides within the individual cell members of the cellular library may vary widely, as 
it is generally difficult to control the number of nucleic acids which enter a cell during 
electroporation, etc. 

15 [0071] The types of cells used in the present invention can vary widely. Basically, any 
mammalian cells may be used, with mouse, rat, primate and human cells being particularly 
preferred, although as will be appreciated by those in the art, modifications of the system by 
pseudotyping allows all eukaryotic cells to be used, preferably higher eukaryotes. Typically, 
a screen will be set up such that the cells exhibit a selectable phenotype in the presence of a 

20 candidate cyclic peptide. Skilled artisans will be able to select an appropriate or suitable cell 
type depending upon, among other factors, the chaperone targeted by the chaperone binding 
region of the candidate cyclic peptides and the types of biological processes regulated by the 
chaperone. Cell types implicated in a wide variety of disease conditions are particularly 
useful, so long as a suitable screen may be designed to allow the selection of cells that exhibit 

25 an altered phenotype as a consequence of the presence of a candidate cyclic peptide within 
the cell. 

[0072] Accordingly, suitable cell types include, but are not limited to, tumor cells of all types 
(particularly melanoma, myeloid leukemia, carcinomas of the lung, breast, ovaries, colon, 
kidney, prostate, pancreas and testes), cardiomyocytes, endothelial cells, epithelial cells, 
30 lymphocytes (T-cell and B cell) , mast cells, eosinophils, vascular intimal cells, hepatocytes, 
leukocytes including mononuclear leukocytes, stem cells such as haemopoetic, neural, skin, 
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lung, kidney, liver and myocyte stem cells (for use in screening for differentiation and de- 
differentiation factors), osteoclasts, chondrocytes and other connective tissue cells, 
keratinocytes, melanocytes, liver cells, kidney cells, and adipocytes. Suitable cells also 
include known research cells, including, but not limited to, Jurkat T cells, NIH3T3 cells, 
5 CHO cells, Cos cells, etc., and the myriad other cell lines described in the ATCC cell line 
catalog, incorporated by reference. As specific examples, T-cells, B-cells, mast cells and 
other types of immune system cells are suitable for candidate cyclic peptides including 
chaperone binding regions that bind immunophilins. 

[0073] The cells in which the candidate polynucleotides are screened may be optionally 
1 0 genetically engineered to contain exogenous nucleic acids. For example, the cells may be 
engineered to include a reporter gene {e.g., a gene encoding a fluorescent protein) fused 
downstream of a specific responsive element or promoter. The selection of the particular 
responsive element or promoter will depend upon the particular process for which 
modulatory cyclic peptides are desired, and will be apparent to those of skill in the art. 
1 5 Alternatively, or in addition, the cells may be optionally engineered to express or overexpress 
the chaperone. However, in preferred embodiments a cell type is selected that highly 
expresses the chaperone endogenously. 

[0074] If necessary, the cells are treated to conditions suitable for the expression of the 
candidate polynucleotides (for example, when inducible promoters are used), to produce the 
20 candidate cyclic peptides. The plurality of cells is then screened, as is more fully outlined 
below, for a cell exhibiting an altered phenotype. The altered phenotype is due to the 
formation of a ternary complex between a candidate cyclic peptide, the chaperone and a 
target molecule, as illustrated in FIG. 1 . 

[0075] By "altered phenotype" or "changed physiology" or other grammatical equivalents 
25 thereof is meant that a phenotype of the cell is altered in some way, preferably in some 

detectable and/or measurable way. As will be appreciated in the art, a strength of the present 
invention is the wide variety of cell types and potential phenotypic changes which may be 
tested using the present methods. Accordingly, any phenotypic change which maybe 
observed, detected, or measured may be the basis of the screening methods herein. Suitable 
30 phenotypic changes include, but are not limited to: gross physical changes such as changes in 
cell morphology, cell growth, cell viability, adhesion to substrates or other cells, and cellular 
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density; changes in the expression of one or more RNAs, proteins, lipids, hormones, 
cytokines, or other molecules; changes in the equilibrium state (i.e. half-life) or one or more 
RNAs, proteins, lipids, hormones, cytokines, or other molecules; changes in the localization 
of one or more RNAs, proteins, lipids, hormones, cytokines, or other molecules; changes in 
5 the bioactivity or specific activity of one or more RNAs, proteins, lipids, hormones, 
cytokines, receptors, or other molecules; changes in the secretion of ions, cytokines, 
hormones, growth factors, or other molecules; alterations in cellular membrane potentials, 
polarization, integrity or transport; changes in infectivity, susceptibility, latency, adhesion, 
and uptake of viruses and bacterial pathogens; etc. The specific phenotypic change selected 
1 0 for observation will depend upon a variety of factors, and will be apparent to those of skill in 
the art. A variety of specific biological processes and their respective phenotypes for which 
modulatory cyclic peptides may be identified in connection with the methods of the present 
invention are described in, among other references, U.S. Patent No. 6,153,380 {see, e.g., 
Cols. 26-34), the disclosure of which is incorporated herein by reference. 

1 5 [0076] The altered phenotype may be detected in a wide variety of ways, as is described more 
fully below, and will generally depend and correspond to the phenotype that is being altered. 
Generally, the changed phenotype is detected using, for example: microscopic analysis of cell 
morphology; standard cell viability assays, including both increased cell death and increased 
cell viability, for example, cells that are now resistant to cell death via virus, bacteria, or 

20 bacterial or synthetic toxins; standard labeling assays such as fluorometric indicator assays 
for the presence or level of a particular cell or molecule, including FACS or other dye 
staining techniques; biochemical detection of the expression of target compounds after killing 
the cells; etc. In embodiments in which the cells have been engineered to include a reporter 
gene, the change in phenotype can be determined by assessing the level of expression of the 

25 reporter gene, either at the transcriptional level or the translational level. 

[0077] In most cases, the altered phenotype is detected in the cell in which the candidate 
polypeptide was introduced (i.e., the cells are assessed for autocrine effects). However, in 
some instances, such as those instances where the desired phenotypic change involves a 
change in the ability of a cell to signal extracellularly, the altered phenotype is detected in a 
30 second cell, or even a third cell, which is responding to some molecular signal from the first 
cell. In this latter embodiment, the second population of cells is assessed for endocrine or 
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paracrin effects. The first cell which received the candidate polynucleotide may or may not 
be assessed for an altered phenotype. Methods for observing or detecting endocrine or 
paracrin effects are well known. 

[0078] Once a cell with an altered phenotype is detected ("positive cell"), the cell may be 
5 isolated from the plurality which do not have altered phenotypes. This may be done in any 
number of ways, as is known in the art, and will in some instances depend on the assay or 
screen. Suitable isolation techniques include, but are not limited to, FACS, lysis selection 
using complement, cell cloning, scanning by Fluorimager, expression of a "survival" protein, 
induced expression of a cell surface protein or other molecule that can be rendered 
10 fluorescent or taggable for physical isolation; expression of an enzyme that changes a non- 
fluorescent molecule to a fluorescent one; overgrowth against a background of no or slow 
growth; death of cells and isolation of DNA or other cell vitality indicator dyes, etc. 

[0079] The candidate polynucleotide from the positive cells may be sequenced, with or 
without prior isolation, to determine the sequence of the putative target binding region of the 
1 5 expressed candidate cyclic peptide. Alternatively, the candidate cyclic peptide may be 
isolated and characterized using standard analytical techniques (e.g. t mass spectroscopy, 
NMR, etc). 

[0080] The identified cyclic peptide may be synthesized synthetically and introduced into the 
target cells to confirm its activity. Alternatively, the candidate polynucleotide may be 
20 reintroduced into the target cells to confirm the activity of the expressed cyclic peptide. 

[0081] The sequence of the putative target binding region may also be used generate 
additional candidate cyclic peptides having altered activities, as is well-known in the art. 

[0082] A significant advantage of the method of the invention is the ability to isolate from the 
positive cells the target molecule(s) with which the CCP-C complex interacts. Since the 
25 chaperone targeted by the chaperone binding region of the candidate cyclic peptides is know, 
the CCP-C complexes may be isolated from lysates of positive cells using well-known 
affinity techniques. As a specific example, the CCP-C complexes may be isolated from 
positive cell lysates using affinity chromatography with an antibody that specifically binds 
the chaperone. 
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[0083] For the production of antibodies, various host animals, including but not limited to 
rabbits, mice, rats, etc., may be immunized by injection with the chaperone of interest. The 
chaperone may be attached to a suitable carrier, such as BSA, by means of a side chain 
functional group or linkers attached to a side chain functional group. Various adjuvants may 
5 be used to increase the immunological response, depending on the host species, including but 
not limited to Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, 
surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, keyhole limpet hemocyanin, dinitrophenol, and potentially useful human 
adjuvants such as BCG (bacilli Calmette-Guerin) and Coiynebacterium parvum. 

10 [0084] Monoclonal antibodies to a specific chaperone may be prepared using any technique 
which provides for the production of antibody molecules by continuous cell lines in culture. 
These include, but are not limited to, the hybridoma technique originally described by Kohler 
& Milstein, 1975, Nature 256:495-497 and/or Kaprowski, U.S. Patent No. 4,376,1 10; the 
human B-cell hybridoma technique described by Kosbor et al, 1983, Immunology Today 

15 4:72 and/or Cote et al, 1983, Proc. Natl. Acad. Sci. USA 80:2026-2030; and the 

EBV-hybridoma technique described by Cole et al, 1985, Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, Lie, pp. 77-96. In addition, techniques developed for the 
production of "chimeric antibodies" (Morrison et al, 1984, Proc. Natl. Acad. Sci. USA 
81:6851-6855; Neuberger et al, 1984, Nature 312:604-608; Takeda et al, 1985, Nature 

20 314:452-454; Boss, U.S. Patent No. 4,816,397; Cabilly, U.S. Patent No. 4,816,567) by 

splicing the genes from a mouse antibody molecule of appropriate antigen specificity together 
with genes from a human antibody molecule of appropriate biological activity can be used. 
Or humanized" antibodies can be prepared (see, e.g., U.S. Patent No. 5,585,089). 
Alternatively, techniques described for the production of single chain antibodies (see, e.g., 

25 U.S. Patent No. 4,946,778) can be adapted to produce chaperone-specific single chain 
antibodies. 

[0085] Antibody fragments which contain deletions of specific binding sites maybe 
generated by known techniques. For example, such fragments include but are not limited to 
F(ab ! )2 fragments, which can be produced by pepsin digestion of the antibody molecule and 
30 Fab fragments, which can be generated by reducing the disulfide bridges of the F(ab')2 
fragments. Alternatively, Fab expression libraries may be constructed (Huse et al, 1989, 
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Science 246:1275-1281) to allow rapid and easy identification of monoclonal Fab fragments 
with the desired specificity for the chaperone. 

[0086] As discussed previously, these various methods can also be used to define sequences 
useful for the chaperone binding region of the candidate cyclic peptides. 

5 [0087] The antibody or antibody fragment specific for the chaperone can be attached, for 
example, to agarose, and the antibody-agarose complex is used in immunochromatography to 
isolate CCP-C complexes of the invention. See, Scopes, 1984, Protein Purification: 
Principles and Practice, Springer- Verlag New York, Inc., NY; Livingstone, 1974, Methods In 
Enzymology: ImmunoafSnity Chromatography of Proteins 34:723-73 1 . The isolation is 
1 0 carried out under conditions conducive to binding between the CCP-C complex and any 

bound target molecule. Such conditions are typically physiological conditions, but may vary 
depending upon the cell type used to screen the candidate cyclic peptides. Any target 
molecule bound to the immobilized CCP-P complexes may be dissociated there from using 
standard techniques and characterized, again using standard techniques. 

1 5 [0088] While the invention has been described by reference to various specific and/or 

preferred embodiments, skilled artisans will recognize that numerous modifications maybe 
made thereto without departing from the spirit and the scope of the appended claims. As a 
specific example, while the methods have been exemplified with reference to cyclic peptide 
candidate compounds, skilled artisans will recognize that virtually any type of molecule can 

20 be screened as a candidate compound according to the methods of the invention. Thus, the 
compounds screened can range from small organic molecules to large polymers and 
biopolymers, and can include, by way of example and not limitation, small organic 
compounds, saccharides, carbohydrates, polysaccharides, lectins, peptides and analogs 
thereof, polypeptides, proteins, antibodies, oligonucleotides, polynucleotides, nucleic acids, 

25 etc. The only requirement is that the candidate compound include two distinct regions or 

portions: one region or portion that binds a specific chaperone, and another region or portion 
that can vary in structure (putative target binding region). 

[0089] As another specific example, while the invention has been described in the context of 
intracellular processes and chaperones, skilled artisans will recognize that the principles of 
30 the invention can also be applied to the identification of cyclic peptides that interact with and 



34 



WO 2004/020981 



PCT/US2003/027370 



modulate extracellularly processes mediated by membrane-bound receptors and/or 
extracellular processes. Thus, serum proteins and other molecules can also be used as 
chaperones according to the principles of the invention. Suitable serum molecules useful as 
chaperones in the methods of the invention include, but are not limited to, plasma proteins 
such as serum albumin, lipoproteins, imm unoglobulins, fibrinogen, transferrin and alpha-1 
acid glycoprotein, as well as non-protein molecules such as heparin. 

[0090] All references cited throughout the disclosure are incorporated herein by reference in 
their entireties for all purposes. 
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CLAIMS 

What is claimed is: 

1 . A method of identifying a cyclic peptide capable of altering a phenotype of a cell, 
comprising the step of: 

administering to the cell a cyclic peptide comprising a chaperone binding 
region of known sequence and a target binding region of wholly or partially unknown 
sequence; and 

assessing whether a phenotype of the cell has been altered. 

2. The method of Claim 1 in which the cyclic peptide is composed wholly of gene- 
encoded amino acids. 

3. The method of Claim 1 in which the chaperone binding region binds an 
immunophilin. 

4. The method of Claim 3 in which the immunophilin is cyclophilin or an FK-binding 

protein. 

5. The method of Claim 4 in which the cyclophilin is selected from the group 
consisting of cypA, cypB, cyC and cycD. 

6. The method of Claim 4 in which the FK-binding protein is selected from the group 
consisting of FKBP12, FKBP13, FKBP25 and FKBP59. 

7. The method of Claim 3 in which the immunophilin is endogenous to the cell. 

8. The method of Claim 3 in which the immunophilin is exogenous to the cell. 

9. The method of Claim 1 in which the chaperone binding region has an amino acid 
sequence selected from the group consisting of Ala-Gly-Pro-He and Leu-Pro. 
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10. The method of Claim 1 further including the step of determining the sequence of 
the target binding region of the cyclic peptide. 

11. The method of Claim 1 in which the target binding region of the cyclic peptide is 
5 composed of from 4 to 10 amino acid residues. 

12. The method of Claim 1 in which the chaperone binding region and the target 
binding region of the cyclic peptide are contiguous. 

10 13. The method of Claim 1 in which the chaperone binding region and the target 

binding region of the cyclic peptide are spaced apart from one another via linkers, which may 
be the same or different. 

14. A method of identifying a cyclic peptide capable of altering a phenotype of a cell, 
1 5 comprising the steps of: 

admini stering to a plurality of cells a plurality of cyclic peptides, each of 
which comprises a chaperone binding region and a target biding region; 

identifying those cells exhibiting an altered phenotype (positive cells); and 
determining the sequence of at least the target binding region of the cyclic 
20 peptides of positive cells. 

15. The method of Claim 14 in which the chaperone binding region of each cyclic 
peptide of the plurality is the same. 

25 1 6. The method of Claim 14 in which each the target binding region of each cyclic 

peptide of the plurality includes a unique random sequence ranging from 4 to 1 0 amino acids 
in length. 

17. The method of Claim 14 in which the chaperone binding region binds an 
30 immunophilin. 

1 8. The method of Claim 17 in which the immunophilin is endogenous to the cells. 
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19. The method of Claim 17 in which the immunophilin is exogenous to the cells. 

20. The method of Claim 17 in which the immunophilin is a cyclophilin. 

5 21. The method of Claim 1 7 in which the immunophilin is an FK binding protein. 

22. A method of isolating a target capable of altering a phenotype of a cell, 
comprising the steps of: 

administering to a cell expressing a chaperone a cyclic peptide comprising a 
10 region capable of binding the chaperone and a target binding region of wholly or partially 
unknown sequence; 

if the cell exhibits an altered phenotype, contacting a lysate of the cell with an 
affinity reagent capable of specifically binding a the chaperone; and 

dissociating and isolating any target bound to the chaperone. 

15 

23. The method of Claim 22 in which the chaperone is an immunophilin. 

24. The method of Claim 23 in which the affinity reagent comprises an anti- 
immunophilin antibody. 

20 

25. The method of Claim 22 in which the affinity reagent is immobilized on a solid 
support. 

26. The method of Claim 22 which further includes the step of determining the 
25 identity of the isolated target. 

27. A method of identifying a target capable of altering a phenotype of a cell, 
comprising the steps of: 

administering to each of a plurality of cells a cyclic peptide comprising a 
30 chaperone binding region and a unique target binding region; 

isolating a cell exhibiting an altered phenotype; 

contacted said isolated cell with an affinity reagent that specifically binds the 

chaperone; 
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isolating there from any bound target; and . 
determining the identity of the isolated target. 

28. A cyclic peptide composed of from 4 to 30 amino acids, comprising: 

5 a chaperone binding region; and 

a target binding region of wholly or partially unknown sequence. 

29. The cyclic peptide of Claim 28 which is composed wholly of gene-encoded 
amino acids. 

10 

30. The cyclic peptide of Claim 28 in which the target binding region is a random 
sequence. 

3 1 . The cyclic peptide of Claim 28 which has the formula: 

15 

cycloCSer-X^X^^-X^^X^X^-L^Y^Y^Y^Y 4 -^ 
wherein: 

X 2 -X 9 are each, independently of one another, a gene-encoded amino acid, 
20 with the proviso that from 1 to 5 of X 2 -X 9 may be absent; 

L l is an optional linker composed of gene-encoded amino acids; 
Y^Y 4 taken together comprise the chaperone binding region; and 
L 2 is an optional linker composed of gene-encoded amino acids. 

25 32. The cyclic peptide of Claim 31 in which Y^Y 4 is selected from the group 

consisting of AGPI and LP . 

33. A library of cyclic peptides, each of which comprises a chaperone binding region 
and a target binding region, wherein the target binding region of each cyclic peptide is 
30 unique. 
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34. The library of Claim 33 in which the chaperone binding region of each cyclic 
peptide is the same, 

35. The library of Claim 33 in which the target binding region is composed of from 4 
to 10 amino acid residues. 

36. The library of Claim 35 which comprises from 20 4 to 20 10 members. 

37. A polynucleotide capable of expressing a cyclic peptide comprising: 

a first segment encoding a C-terminal intein domain; 

a second segment encoding a linear version of cyclic peptide, said cyclic 
peptide comprising a chaperone binding region and a target binding region; and 

a third segment encoding an N-terminal intein domain, wherein the first, 
second and third segments are arranged such that the polynucleotide expresses a cyclic 
peptide. 

38. The polynucleotide of Claim 37 which is single stranded. 

39. The polynucleotide of Claim 37 which further includes an inducible promoter 
operably linked to the first segment. 

40. The polynucleotide of Claim 37 in which the cyclic peptide further includes 
linkers intervening the chaperone binding region and the target binding region. 

41. A library of polynucleotides capable of expressing cyclic peptides, each 
polynucleotide of the library comprising: 

a first segment encoding a C-terminal intein domain; 

a second segment encoding a linear version of a cyclic peptide, said cyclic 
peptide comprising a chaperone binding region and a target binding region; and 

a third segment encoding an N-terminal intein domain, wherein said first, 
second and third segments are arranged such that the polynucleotide is capable of expressing 
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the cyclic peptide and wherein the target binding region of each expressed cyclic peptide is 
unique. 

42. A host cell comprising a polynucleotide according to Claim 41, or progeny 

5 thereof. 

43. The host cell or progeny of Claim 42 in which the polynucleotide is integrated 
into the genome of the cell or progeny. 

10 44. A library of cells in which each cell of the library comprises a polynucleotide 

capable of expressing a cyclic peptide comprising a chaperone binding region of known 
sequence and a target binding region of wholly or partially unknown sequence, wherein each 
cell comprises a unique polynucleotide. 

15 
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